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Abstract 

Electroweak production of the top quark is measured in pp collisions at a/s = 7 TeV, 
using a dataset collected with the CMS detector at the LHC and corresponding to an 
integrated luminosity of 36 pb^^. With an event selection optimized for f-channel pro- 
duction, two complementary analyses are performed. The first one exploits the spe- 
cial angular properties of the signal, together with background estimates from data. 
The second approach uses a multivariate analysis technique to probe the compatibil- 
ity with signal topology expected from electroweak top quark production. The com- 
bined measurement of the cross section is 83.6 ± 29.8 (stat. + syst.) ± 3.3 (lumi.) pb, 
consistent with the standard model expectation. 

Submitted to Physical Review Letters 



*See Appendix A for the list of collaboration members 



1 



Electroweak theory predicts three mechanisms for single top quark production in hadron- 
hadron collisions: i-channel, s-channel, and tW (or W-associated) production. Single-top events 
have been observed by the DO and CDF experiments at the Tevatron pp collider [1 -3J, and first 
measurements of individual channels have recently been reported ||4l|5l. In proton-proton col- 
lisions at 7 TeV, f-channel single top quark production. Fig. [l| has the largest cross section and 
the cleanest final-state topology, because of the presence of a light jet recoiling against the single 
top quark. Next-to-leading order (NLO) computations with resummation of collinear and soft- 
gluon corrections at next-to-next-to-leading logarithmic accuracy predict dt = 64:.3t_l'lt\'7 pb 0^ 
for a top mass of nit = 173 GeV/c^ and with parton distribution functions (PDFs) as given in 
Ref. [7]. The first uncertainty comes from doubling and halving the renormalization and fac- 
torization scales and the second from PDF uncertainty at the 90% confidence level. 




Figure 1: Feynman diagrams for single top quark production in t channel: 2 — > 2 (left) and 
2^3 (right) processes. 

This Letter presents the first measurement of the i-channel single top quark production cross 
section in pp collisions at -\/s = 7 TeV in the decay channels t — > evb, t — >/^i/b, and t — >Ti/b 
with leptonic t decays. Two complementary measurements are performed. The first analysis 
exploits two angular observables sensitive to f-channel single top quark production: the non- 
central pseudorapidity distribution of the light jet, and the cosine of the angle between this jet 
and the final-state lepton, in the reconstructed top-quark rest frame. A multivariate analysis 
technique with boosted decision trees (BDT) [8 , 9| is used in the second method, which probes 
the overall compatibility of the signal event candidates with the event topology of electroweak 
top quark production. Hereafter, these analyses will be referred to as 2D and BDT analysis, 
respectively. 

Both analyses use a data sample corresponding to an integrated luminosity of 35.9 ±1.4 pb^^ llTO| , 
collected by the Compact Muon Solenoid (CMS) detector ||TT| operating at the Large Hadron 
Collider (LHC). The central feature of the CMS detector is a superconducting solenoid pro- 
viding a field of 3.8 T. Located within the solenoid are the silicon pixel and strip tracker, the 
crystal electromagnetic calorimeter and the brass /scintillator hadron calorimeter. Muons are 
measured in gas-ionisation detectors embedded in the steel return yoke. In addition to the 
barrel and endcap detectors, a quartz-fiber Cherenkov detector extends the jet acceptance to 
\f]\ =5, where the pseudorapidity rj is defined as ?y = — In tan where 6 is the polar angle of 
the particle or jet trajectory with respect to the counterclockwise beam direction. 

Events are selected by requiring the presence of at least one muon or electron having high trans- 
verse momentum (px)- The particle flow (PF) algorithm described in ||T2| performs a global 
event reconstruction and provides the full list of particles identified as electrons, muons, pho- 
tons, charged and neutral hadrons. A fully reconstructed isolated muon (electron) candidate 
originating from the leading primary vertex is required [,13J with pj > 20 (30) GeV/c, |^| < 2.1 
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(2.5), and a veto is applied on additional leptons passing lower thresholds. 

Jets are reconstructed using the anti-Zc^ algorithm IIT4l with a distance parameter of 0.5, clus- 
tering particles identified by the PF algorithm. Jets within the full calorimeter acceptance are 
considered, with pj > 30 GeV/c after corrections for the jet energy scale, as determined from 
simulations and collision data [,151 . The BDT analysis first identifies isolated leptons, which are 
then excluded form the jet clustering step. In the 2D analysis, possible jet-lepton ambiguities 
are resolved on the basis of the distance AR = \J (Atj)^ + (Acp)^ between the reconstructed jet 
and the nearest lepton. The event is accepted for further analysis only if exactly two jets are 
reconstructed. 

In order to reduce the large background from W + light partons, we apply a b-tagging algo- 
rithm [16| that calculates the signed 3D impact-parameter IP significance (IP/t7jp) of all the 
tracks associated with the jet passing tight quality criteria. The tracks are ordered decreasingly, 
following their value of IP/ tTjp, and a tight selection threshold is applied on the impact param- 
eter significance of the third track in the list. This threshold corresponds to a b-jet identification 
efficiency of ~40% and a misidentification rate of ~0.1% determined in data as a function of 
Pt and t] |(T6l . The 2D analysis exploits the expectation that most of the signal events, even in 
the 2 — )■ 3 process, have only one b quark inside the tracking acceptance {\rj\ < 2.4). Events 
are rejected if the jet failing the tight threshold passes a loose threshold on the IP significance 
of the second track. The loose threshold corresponds to an efficiency and misidentification rate 
of about 80% and 10%, respectively. The BDT analysis applies no veto on the second b-tagged 
jet, and rejects events where the jets are back-to-back, which are found to be poorly reproduced 
by the W + jets simulation. To further suppress contributions from processes where the muon 
(electron) does not come from the decay of a W boson, we require a transverse mass of the W 
boson Mt > 40 (50) GeV/c^, where the transverse missing energy (£^'^®) from the PF algorithm 
is used as a measurement of the pj of the undetected neutrino. 

The 2D analysis selects 112 (72) events in the muon (electron) decay channel, while the BDT 
analysis selects 139 (82). In both analyses a signal purity of around 18% (16%) is expected in 
the muon (electron) decay channel. The main backgrounds are tt, Wbb, W + light-partons, 
Wc, tW, and processes where the lepton does not originate from a W/ Z, hereafter called QCD 
events. 

The i-channel events from Monte Carlo simulation used in this study have been generated 
with the MadGraph 4.4 event generator |fT7il . To give a fair approximation of the full next- 
to-leading order properties of the signal, we combine the dominant NLO contribution (2— >3 
diagram qg q'tb and its charge conjugate) with the leading order diagram (2^2, qb q't) 
by a matching procedure based on Ref. I[T8| . MadGraph is used also for tt, single top s 
and tW channels, and W/Z + jets. The remaining background samples were simulated us- 
ing Pythia 6.4.22 imi; these include di-boson production (WW, WZ, ZZ), 7 + jets, multi-jet 
QCD enriched in events with electrons or muons coming from the decay of b and c quarks or 
muons from the decay of long-lived hadrons, and particles with large probability to leave a 
high-energy electromagnetic deposit. The CTEQ 6.6 PDF sets |20| are used for all simulated 
samples. All generated events undergo a full simulation of the detector response based on 
GEANT4 [21J. 

The NLO theoretical prediction is used to normalize the single-top production in s and tW 
channels I|22ll23l and di-boson processes [24]. The tt cross section is normalized to 150 pb, with 
uncertainty constrained to the result of a dedicated analysis. The same analysis constrains the 
^QQ {V = W,Z and Q = b, c) and Wc components, obtaining in particular a factor of 2±1 for 
Wbb over the LO prediction. 
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The QCD yield is estimated from the same data set by a maximum Hkelihood fit to the Mj 
distribution after all other selection criteria have been applied. The Mj distribution for QCD 
events is taken from a control sample obtained by inverting the lepton isolation requirement. 
The latter requirement rejects most of the signal-like events (single top, W/Z + jets, tt) leaving a 
QCD-dominated sample. The distribution for the sum of all non-QCD processes is taken from 
simulation. The uncertainty on this estimate is conservatively estimated such as to cover the 
differences observed when varying the fit range and the QCD shape. 

The BDT analysis normalizes the result of the W + jets simulation to the inclusive W cross sec- 
tion at NNLO [24 J, while collision data are used in the 2D analysis to extract the normalization 
of the W + light-partons background. Two control samples are used, orthogonal to the stan- 
dard selection. Control sample 'region-A', dominated by the W + light partons background, is 
defined by the requirement of one isolated lepton and exactly two jets, one of which is required 
to be within the tracker acceptance and with at least two tracks satisfying the quality selection 
of the b-tagging algorithm. Both jets should fail the tight b-tagging selection. A second control 
sample, 'region-B', is defined as a subset of the former where at least one jet passes the loose 
b-tagging selection although it fails the tight one. In both samples a fit of the Mj distribution 
is performed, allowing the QCD and W + light-partons background to float, while all other 
processes, including heavy-flavour contributions and the f-channel signal, are constrained to 
their expected values. A scale factor of 1.27 in the muon and 1.05 in the electron decay channel 
is observed between the number of W + light-partons events obtained from the fit in sample 
region-B and the predictions from simulation. These scale factors are used to obtain the central 
value of the predicted background. A ±30% (±20%) uncertainty is assigned on the muon (elec- 
tron) scale factor, covering both the statistical uncertainty from the fit, the difference between 
the background predictions obtained from the two control samples, region-A and region-B, 
and between data and simulation results for both samples. The normalization of Z + jets back- 
ground is rescaled by the same factor as that for the W + light-parton background. 

A top-quark candidate is reconstructed in each event by pairing the b-tagged jet with a W- 
boson candidate. The latter is reconstructed by imposing the W-boson mass as a kinematic 
constraint, leading to a quadratic equation in the longitudinal neutrino momentum, pz,v When 
two real solutions are found the smallest | pz,v \ is taken, and for complex solutions the imaginary 
component is eliminated by modifying E^^^ and E^^'^ independently, such as to give Mj = 

Mw Ilia. 

In the 2D analysis a two-dimensional maximum likelihood fit is performed. One of the two fit 
variables is the cosine of the angle 6* between the direction of the outgoing lepton and the spin 
axis, approximated by the direction of the untagged jet, in the top-quark rest frame I|26ll27l. 
This observable has a distinct slope in signal events, coming from the almost 100% polarization 
of the top quark due to the V — A structure of the electroweak interaction Il28l . This property 
holds true also in many theories beyond the standard model (SM) [29]. The other fit variable 
is the pseudorapidity distribution of the untagged jet, ?yiight jet/ interpreted as the light quark 
jet recoiling against the single top, whose characteristic r] distribution allows a discrimination 
against the typically central jets from the main background processes. The distributions in 
cos 9* and '/ught jet ^re shown in Fig.|2]for events passing the 2D selection. 

The inputs to the fit are the distributions for signal and backgrounds in the cos 0*-?/iight jet plane, 
separately in the muon and electron decay channels. The overall background is allowed to float 
unconstrained in the fit, while its relative components are fixed according to the background 
estimates. The QCD and W + light-partons shapes are taken from the anti-isolated and region- 
A control samples described above, respectively, while all others are taken from the simulation. 
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Figure 2: Cosine of the angle between charged lepton and untagged jet (cos 9*, top panel) 
and pseudorapidity of the untagged jet (//Hght jet/ bottom panel) after the 2D selection, for both 
electron and muon decay channels. QCD and W + light-partons events are normalized to 
data, tt, VQQ {V = W,Z and Q = b,c), and Wc are normalized to the result of a dedicated 
measurement, all other processes are normalized to theoretical expectations. 



5 



The BDT method combines a given set of observables into one single classifier variable bdt. A 
total of 37 observables have been chosen. Their selection has been inspired by the DO anal- 
ysis [30j and optimised for the LHC kinematics. The most discriminant ones are the lepton 
momentum, the mass of the system formed by the reconstructed W boson and the two jets, the 
Pj of the system formed by the two jets, the pj of the jet passing tight b-tagging requirements, 
and the reconstructed top-quark mass. The validity of the description of all the input variables 
in the simulation has been checked using a Kolmogorov-Smirnov test in a W-enriched control 
sample with no b-tagged jet, shown in Fig. |3] (top). The bdt classifier has been validated both 
in simulation and in data: negligible differences are found by comparing its distribution for 
signal events with MadGraph, SinGLETOP [18], and MC@NLO 3.4 ||3U, and for tt events with 
MadGraph, pythia and mc@NLO. In the W-enriched control sample the distribution of bdt 
from the simulation is statistically compatible with data. 

The cross section is extracted from binned bdt distributions using a Bayesian approach. The 
normalizations of the backgrounds and the other systematic uncertainties are treated as nui- 
sance parameters. The measured distribution of the classifier bdt is shown in Fig. |3] (bottom). 

The following sources of systematic uncertainties are common to both analyses: background 
normalization; jet energy scale [15 1, propagated coherently to the E^^^^ measurement; calibra- 
tion of the unclustered energy deposits contributing to £™®®, varied by ±10%; b-tagging and 
mistagging efficiencies |[T6l : modeling of the signal and of the main backgrounds; and a 4% 
uncertainty on the integrated luminosity IITOli . 

The uncertainty on the signal model is estimated by comparing MadGraph and SingleTop 
events with different fragmentation models. The uncertainty on the tt and W/ Z + jets models 
is determined by comparing simulated samples with varied renormalization and factorization 
scale (within half and double the nominal value, independently for tt and for W/Z + jets), 
initial- and final-state radiation parameters, and two different fragmentation models. 

The impact of pile-up is estimated by comparing the default simulated samples with no pile-up 
and dedicated samples where minimum bias interactions are superimposed with a probability 
distribution roughly corresponding to the one observed in the overall 2010 dataset. The shapes 
of the bdt classifier and of both variables used in the 2D analysis are negligibly affected. 

In the 2D analysis a conservative systematic uncertainty is assigned to the degree of correlation 
between J/ught jet arid cos 9* (estimated as 6% from simulation) by comparing to the result ob- 
tained using the product of uncorrelated one-dimensional distributions for the signal. The W + 
light-partons background shapes in //ught jet arid cos 9* are extracted from data in the 2D analy- 
sis, and studies with simulated events show that the shapes extracted from the control sample 
are statistically consistent with those in the signal region for the same process. Nevertheless, a 
small difference is observed in the flight jet shapes in the two selections for the Wc process, and 
we conservatively consider this difference as a systematic uncertainty on all W + jets processes. 

The efficiencies of the muon and electron triggers, identification, and isolation for the 2D selec- 
tion have been evaluated from data using dilepton events at the Z peak Iil3j . The uncertainties 
on these efficiencies have a negligible effect on this analysis. 

The impact of each individual source of uncertainty on both analyses has been estimated with 
an ensemble of pseudoexperiments. The dominant systematic uncertainty on the cross section 
determination comes from the b-tagging efficiency, known within ±15%, because of its large 
effect on the signal acceptance. Nevertheless, this source has a negligible effect on the shapes 
of the final discriminant variables in both analyses. Other important systematic uncertainties 
come from the signal model, the factorization /renormalization scale for W/Z + jets, the jet 
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Figure 3: Boosted decision tree discriminar\t (bdt) for both electron and muon decay channels in 
the W-enriched control sample (top panel), with simulation normalized to data, also shown for 
W + jets samples with doubled and halved renormalization and factorization scale (Q). Same 
observable after the complete BDT selection (bottom panel), with signal scaled to the measured 
cross section and all systematic uncertainties and backgrounds scaled to the medians of their 
posterior distributions. 
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Table 1: Cross section measurements by channel and by analysis. The first uncertainty is sta- 
tistical, the second systematic. An additional 4% uncertainty on the luminosity HOl for each 
measurement is not included. 



Channel 


2D analysis 


BDT analysis 


e 

}i + e 


104.1 ± 42.3 +^^:^ pb 

154.2 ± 56.0 pb 
124.2 ± 33.8 pb 


90.4 ±35.1+}^! pb 
59.2 ± 35.1 j pb 
78.7 ± 25.4 pb 



energy scale, and the Wc background. 

Table [l] shows the cross section measured by both analyses in each decay channel, corrected 
for acceptance and branching ratios. In the muon + electron combination all systematic un- 
certainties are considered fully correlated, with the exception of the uncertainty on multi-jet 
QCD obtained from data. All measurements are consistent among each other and with the SM 
expectation. 

Under the assumption that all uncertainties are Gaussian and symmetric, which is fulfilled by 
the dominant uncertainties, the 2D and BDT cross section measurements are combined with 
the BLUE technique [32j, taking into account a statistical correlation of 51% estimated with 
pseudoexperiments, and treating all the systematic uncertainties as fully correlated with the 
exceptions of those coming from estimates based on data. The combined result is cr^^P = 83.6 ± 
29.8 (stat. + syst.) ± 3.3 (lumi.) pb where the BDT analysis contributes with the largest weight 
(89%). 

The expected and observed significances, including systematic uncertainties, are estimated 
with an ensemble of pseudoexperiments. The probability of the predicted background dis- 
tributions to fluctuate to the observed data corresponds to 3.7 (3.5) Gaussian standard devia- 
tions in the 2D (BDT) analysis, combining the electron and muon decay channels, while 2.1^J 5 
(2.9lg 9) expected when assuming SM i-channel production cross section. The combined 
significance is well approximated by the BDT significance of 3.5 Gaussian standard deviations. 

The single-top cross section measurement can be used as a test of the CKM matrix unitar- 
ity 133] under the assumption that |Vtd| and |Vts| are much smaller than |Vtb|/ and therefore 

that I Vtb I = ^J ^ where (7* is the SM prediction under the | Vtb I = 1 assumption. Using the 

prior knowledge that < |V^tbP ^ 1/ at the 95% confidence level we infer the lower bound 
I Vtb I > 0.62 (0.68) from the 2D (BDT) analysis, respectively 

In summary, we confirm the Tevatron observation of single top quark production and present 
the first measurement of the f-channel single top quark production cross section in pp collisions 
at a/s = 7 TeV, finding a good agreement with the SM prediction [6J. 
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